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1.  Executive  Summary 

The  proposed  researeh  effort  builds  on  and  extends  the  work  of  the  previous  ONR-funded 
“Validation  Coverage  Toolkit  for  HSCB  Models”  projeet.  The  overall  objeetives  of  the  on¬ 
going  researeh  program  are: 

•  Help  scientists  create,  analyze,  refine,  and  validate  rich  scientific  models 

•  Help  computational  scientists  verify  the  correctness  of  their  implementations  of  those 
models 

•  Help  users  of  scientific  models,  including  decision  makers  within  the  US  Navy,  to  use 
those  models  correctly  and  with  confidence 

•  Use  a  combination  of  human-driven  data  visualization  and  analysis,  automated  data 
analysis,  and  machine  learning  to  leverage  human  expertise  in  model  building  with 
automated  analyses  of  complex  models  against  large  datasets 

Specific  objectives  for  the  current  effort  include: 

•  Fluid  temporal  correlation  analysis.  Our  objective  is  to  design  a  new  method  for 
performing  temporally  fluid  correlation  analysis  for  temporal  sets  of  data  and 
implement  the  method  as  a  new  prototype  component  within  the  Model  Analyst’s 
Toolkit  (MAT)  software  application. 

•  Automated  suggestions  for  model  construction  and  refinement.  Our  objective  is  to 
design  and  implement  a  prototype  mechanism  that  learns  from  data  how  factors  interact 
in  non-trivial  ways  in  scientific  models. 

•  Data  validation  and  repair.  Our  objective  is  to  design  and  implement  a  prototype 
capability  to  identify  likely  errors  in  data  based  on  anomalies  relative  to  historic  data 
and  to  use  models  of  historic  data  to  offer  suggested  repairs. 

•  System  prototyping.  Our  objective  is  to  incorporate  all  improvements  into  the  MAT 
software  application  and  make  the  resulting  application  available  to  the  government  and 
academic  research  community  for  use  in  scientific  modeling  projects. 

•  Evaluation  of  applicability  to  multiple  scientific  domains.  Our  objective  is  to  ensure 
(and  demonstrate)  that  MAT  can  be  applied  to  a  wide  range  of  scientific  domains  by 
identifying  and  building  at  least  one  neurological  and/or  physiological  model  and 
analyze  the  associated  data  with  MAT,  making  any  extensions  to  the  MAT  tool  that  are 
needed  to  support  the  analysis  of  such  a  model. 

2.  Overview  of  Problem  and  Technical  Approach 

2.1 .  Summary  of  the  Problem 

One  of  the  most  powerful  things  scientists  can  do  is  to  create  models  that  describe  the  world 
around  us.  Models  help  scientists  organize  their  theories  and  suggest  additional  experiments  to 
run.  Validated  models  also  help  others  in  more  practical  applications.  For  instance,  in  the  hands 
of  military  decision  makers,  human  social  cultural  behavior  (HSCB)  models  can  help  predict 
instability  and  the  socio-political  effects  of  missions,  whereas  models  of  the  human  brain  and 
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mind  can  help  educators  and  trainers  create  curricula  that  more  effectively  improve  the 
knowledge,  skills,  and  abilities  of  their  pupils. 

While  there  are  various  software  tools  that  are  used  by  the  scientific  community  to  help  them 
develop  and  analyze  their  models  (e.g..  Excel,  R,  Simulink,  Matlab),  they  are  largely  so  general 
in  purpose  (e.g..  Excel,  R)  or  so  focused  on  computational  models  in  particular  (e.g.,  Simulink, 
Matlab),  that  they  are  not  ideal  for  rapid  model  exploration  or  for  use  by  non-computational 
scientists.  They  also  largely  ignore  the  problem  of  validating  the  models,  especially  when  the 
models  are  positing  causal  claims  as  most  interesting  scientific  models  do.  To  address  this  gap, 
Charles  River  Analytics  undertook  the  “Validation  Coverage  Toolkit  for  HSCB  Models” 
project  with  ONR.  Under  this  effort,  we  successfully  designed,  implemented,  informally 
evaluated,  and  deployed  a  tool  called  the  Model  Analyst’s  Toolkit  (MAT),  which  focused  on 
supporting  social  scientists  to  visualize  and  explore  data,  develop  causal  models,  and  validate 
those  models  against  available  data  (Neal  Reilly,  2010;  Neal  Reilly,  Pfeffer,  Barnett  et  ah, 
2011,2010). 

As  part  of  the  development  of  the  MAT  tool,  we  identified  four  important  extensions  to  that 
research  program  that  would  further  support  the  scientific  modeling  process: 

•  Correlation  analyses  are  still  the  standard  way  of  identifying  relationships  between 
factors  in  a  model,  but  correlations  are  fundamentally  flawed  as  a  tool  for  analyzing 
potentially  causal  or  predictive  relationships  as  they  assume  instantaneous  effects.  Even 
performing  correlation  analyses  with  a  temporal  offsets  between  streams  of  data  is 
insufficient  as  the  temporal  gap  between  the  causal  or  predictive  event  and  the 
following  event  may  not  be  the  same  every  time  (either  because  of  variability  in  the 
system  being  modeled  or  because  of  variability  introduced  by  a  fixed  sampling  rate). 
What  we  need  is  a  novel  way  of  evaluating  the  true  predictive  power  across  streams  of 
data  that  can  deal  with  fluid  offsets  between  changes  in  one  stream  of  data  and  follow 
events  in  the  other  stream  of  data. 

•  Modeling  complex  phenomena  is  a  fundamentally  difficult  task.  Human  intuition  and 
analysis  is  by  far  the  most  effective  way  of  performing  this  task,  but  even  humans  can 
be  overwhelmed  by  the  complexity  of  modeling  the  systems  they  are  studying  (e.g., 
socio-political  system,  human  neurophysiology).  Automated  tools,  while  not  especially 
good  at  generating  reasonable  scientific  hypotheses,  are  extremely  good  at  processing 
large  amounts  of  data.  We  believe  there  is  an  opportunity  for  computational  systems  to 
enhance  human  scientific  inquiry.  Under  the  “Validation  Coverage  Toolkit  for  HSCB 
Models”  project,  we  demonstrated  how  automated  tools  could  help  human  scientists  to 
analyze  and  validate  their  models  against  data.  We  believe  a  similar  approach  can  be 
used  to  help  suggest  modifications  to  the  human-built  models  to  make  them  better 
match  the  available  data.  To  be  useful,  however,  such  automated  analyses  will  need  to 
be  rich  enough  to  suggest  subtle  data  interactions  that  are  most  likely  to  be  missed  by 
the  human  scientist.  Eor  instance,  correlations  (especially  correlations  that  take  into 
account  fluid  temporal  displacements)  could  be  used  to  identify  likely  relationships 
between  streams  of  data,  but  such  an  approach  would  miss  complex,  non-linear 
relationships  between  interrelated  factors  that  cannot  be  effectively  analyzed  with 
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simple  two-way  correlations.  For  instance,  if  crime  waves  are  associated  with  increases 
in  unemployment  or  drops  in  the  police  presence,  that  would  be  hard  to  identify  with  a 
correlation  analysis.  We  need  richer  automated  data  analysis  techniques  that  can  extract 
complex,  non-linear,  multi-variable  relationships  between  data  if  we  are  to  effectively 
suggest  model  improvements  to  human  scientists. 

•  Even  if  a  scientific  model  is  sound,  if  the  data  sets  provided  as  inputs  to  the  model  are 
unreliable,  the  results  of  the  model  are  still  suspect.  And,  unfortunately,  data  will  often 
be  wrong.  For  instance,  HSCB  surveys  are  notoriously  unreliable  and  biased  for  a 
variety  of  reasons,  and  neurological  and  physiological  data  can  be  corrupted  by  broken 
or  improperly  used  sensors.  If  it  were  possible  to  identify  when  data  was  unreliable  and, 
ideally,  even  repair  the  data,  then  the  models  that  are  using  the  data  could  once  again  be 
effectively  used. 

•  The  MAT  tool  we  developed  under  the  “Validation  Coverage  Toolkit  for  HSCB 
Models”  project  was  focused  primarily  on  assisting  social  scientists  in  the  analysis, 
refinement,  and  validation  of  HSCB  models.  In  parallel  with  that  effort,  however,  we 
also  took  an  opportunity  to  apply  MAT  to  evaluating  neurological  and  physiological 
data  under  the  DARPA-funded  CRANIUM  (Cognitive  Readiness  Agents  for  Neural 
Imaging  and  Understanding  Models)  program.  We  discovered  the  generality  of  the 
MAT  tool  makes  it  potentially  applicable  to  a  great  number  of  different  scientific 
domains.  MAT  proved  to  be  a  useful,  but  peripheral  tool,  in  CRANIUM.  We  believe 
MAT  could  be  applied  to  a  broader  suite  of  scientific  modeling  problems  than  it  has 
been  so  far. 

2.2.  Summary  of  our  Approach 

To  address  these  identified  gaps  and  opportunities,  we  are  extending  MAT’s  support  for  model 
development,  analysis,  refinement,  and  validation;  enhancing  MAT  to  analyze  and  repair  data; 
and  demonstrating  MATs  usefulness  in  additional  scientific  modeling  domains.  Our  approach 
encompasses  the  following  four  areas,  which  correspond  to  the  four  gaps/opportunities 
identified  in  the  previous  section: 

•  Temporally  Fluid  Correlation  Analysis.  We  are  designing  a  new  method  to  perform 
Temporally  Fluid  Correlational  Analysis  on  temporal  sets  of  data,  and  we  are 
implementing  the  method  as  a  new  component  within  the  MAT  software  application. 
The  version  of  MAT  at  the  beginning  of  the  new  effort  supported  correlation  analysis 
for  temporally  offset  data;  it  shifts  the  two  data  streams  being  compared  by  a  fixed 
offset  that  is  based  on  the  sampling  rate  of  the  data  (i.e.,  data  that  is  sampled  annually 
will  be  shifted  by  one  year  at  a  time),  performs  a  standard  correlation  on  the  shifted 
data,  plots  the  correlation  value  against  the  amount  of  the  offset,  and  then  repeats  the 
process  for  the  next  offset  amount.  If  two  data  streams  are  shifted  by  a  fixed  offset  (e.g., 
changes  in  one  stream  are  always  followed  by  a  comparable  value  in  the  other  stream 
after  a  fixed  time),  then  this  method  will  find  that  offset.  Under  the  current  effort,  we 
are  expanding  on  this  capability  to  support  fluid  temporal  shifts  within  the  data  streams. 
That  is,  we  are  making  it  possible  to  identify  when  the  temporal  offset  between  the 
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change  in  the  first  data  stream  and  its  effeet  in  the  seeond  stream  is  not  a  statie  amount 
of  time. 

•  Automated  suggestions  for  model  construction  and  refinement.  We  are  designing 
and  implementing  a  meehanism  to  learn  how  faetors  interaet  in  non-trivial  ways  in 
scientifie  models.  In  partieular,  we  are  developing  a  method  for  learning  disjunets, 
eonjunets,  and  negations.  This  meehanism  starts  with  the  model  developed  by  the 
scientist  user  and  make  recommendations  for  possible  adjustments  to  make  it  more 
complete  by  performing  statistical  data  mining  and  machine  learning. 

•  Data  validation  and  repair.  Recognizing  that  data  contains  errors  is  plausible  once  we 
understand  the  relationships  between  data  sets.  That  is,  if  we  are  able  to  develop  models 
of  the  correlations  between  sets  of  data,  then  we  can  build  systems  that  notice  when 
these  correlations  do  not  hold  in  new  data,  indicating  possible  errors  in  data.  For 
instance,  if  we  know  that  public  sentiment  tends  to  vary  similarly  between  nearby 
towns,  then  when  one  town  shows  anomalous  behavior,  we  can  reasonably  suspect 
problems  with  the  data.  There  might  be  local  issues  that  cause  the  anomaly,  but  it  is,  at 
least,  worth  noting  and  bringing  to  the  attention  of  the  user  of  the  data  and  model.  As 
MAT  is  designed  to  help  analyze  models  and  recognize  inter-data  relationships,  it  is 
primed  to  perform  exactly  this  analysis.  Existing  methods  perform  similar  types  of 
analysis  for  environmental  data  (Dereszynski  &  Dietterich,  2007,  2011).  For  instance,  a 
broken  thermometer  can  be  identified  and  the  data  from  it  even  estimated  by  looking  at 
the  temperature  readings  of  nearby  thermometers,  which  will  generally  be  highly 
correlated. 

•  Application  to  multiple  scientific  modeling  domains.  To  ensure  (and  demonstrate) 
that  MAT  can  be  applied  to  a  wide  range  of  scientific  domains,  we  are  identifying  and 
building  at  least  one  neurological  and/or  physiological  model  and  analyzing  the 
associated  data  with  MAT,  making  any  extensions  to  the  MAT  tool  that  are  needed  to 
support  the  analysis  of  such  a  model.  The  initial  MAT  effort  focused  on  HSCB  models; 
by  focusing  this  effort  on  harder-science  models  at  much  shorter  time  durations,  we 
believe  we  can  effectively  evaluate  an  interesting  range  of  applications  of  the  MAT 
tool. 

3.  Current  Activities  and  Status 

During  the  current  reporting  period,  we  made  progress  on  the  causal  model  recommendation 
component  and  the  new  data  synthesis  component.  We  also  submitted  a  full  paper  to  the 
American  Political  Science  Association’s  annual  meeting  and  have  been  invited  to  present  the 
paper  as  part  of  a  panel  on  Information  Technologies  in  Politics  and  Political  Science.  We 
describe  these  results  in  more  detail  in  this  Section  and  the  next  Section. 

3.1.  Causal  Model  Recommendation 

In  the  previous  version  of  MAT,  the  causal  model  recommender  generated  candidate  causal 
models  based  on  data  features  provided  by  the  user  and  presented  the  models  that  are  on  the 
Pareto  Frontier  to  the  user.  The  new  version  of  the  recommender  can  now  automatically 
extract  features  from  the  user’s  data  and  generate  possible  causal  models  using  those  features. 
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The  automatic  feature  extraction  is  explained  in  more  detail  in  the  following  section,  but  it 
looks  for  and  categorizes  interesting  features  in  the  data.  These  automatically  generated 
features  and  the  user  supplied  features  can  together  be  used  to  generate  candidate  causal 
models.  If  a  causal  model  explains  the  phenomenon  under  study  well,  then  the  user  can  add  the 
causal  model  and  the  automatically  generated  features  to  the  MAT  project  document  for  further 
examination.  This  new  functionality  allows  the  discovery  of  causal  models  using  data  features 
that  the  user  may  have  previously  been  unaware  of  The  following  example  will  help  illustrate 
the  usefulness  of  this  functionality. 

Figure  1  shows  a  simple  example  where  the  user  is  trying  to  find  the  cause  of  an  increase  in 
violent  crimes.  In  this  case,  the  user  provided  data  features  where  there  was  an  increase  in 
unemployment  and  found  a  causal  model  that  explained  all  of  the  periods  of  increased  violent 
crime. 


Figure  1.  Initial  results  of  user-defined  causal  model  analysis 

However,  the  causal  model  is  not  entirely  satisfactory  due  to  the  single  cause  having  two 
effects  with  a  period  of  low  violent  crime  between  the  two  effects.  In  the  list  of  candidate 
causal  models  (i.e.,  the  Pareto  Frontier),  some  recommendations  use  features  not  provided  by 
the  user.  For  example.  Figure  2  shows  a  model  that  uses  automatically  generated  features  where 
there  is  a  dramatic  decrease  in  the  size  of  the  police  force  that  had  not  been  explicitly  identified 
as  an  event  of  interest  by  the  user.  This  causal  model  also  explains  all  periods  of  increased 
crime,  but  does  not  have  a  causal  link  extending  across  multiple  effects.  Furthermore,  the 
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temporal  delay  between  high  unemployment  and  high  crime  decreased  from  6  years  in  the 
previous  model  to  1  year  in  this  one. 


Figure  2.  Improved  causal  model  fouud  usiug  automatically  ideutified  feature  eveuts 

3.2.  Data  Synthesis 

The  new  MAT  data  synthesis  capability  was  completed  and  tested  during  the  current  reporting 
period.  The  ability  to  synthesize  new  data  from  existing  data  using  basic  arithmetic  operations 
was  a  top  feature  request  by  our  users.  The  capability  uses  the  Graphler  data  model  as  its 
underlying  graphical  and  data  support.  Users  who  are  familiar  with  using  the  existing  modeling 
system  rapidly  learn  how  to  use  the  synthesis  process  which  is  similar.  Figure  3  shows  the 
resulting  screen. 
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Figure  3.  New  data  synthesis  capability  in  MAT 

The  screen  is  a  new  tab  in  the  system  which  is  similar  to  the  modeling  (Graphler)  screen.  Inputs 
are  dragged  onto  the  canvas  from  the  data  series  list  in  the  lower  right  pane  and  then  connected 
to  operational  nodes  using  directed  edges.  The  operation  nodes  are  then  connected  to  “synthetic 
nodes”  which  the  user  drags  from  the  tool  bar  and  have  a  distinctive  brown  background.  The 
user  names  the  synthetic  nodes  the  same  way  as  concept  nodes  are  named  and  edited  in  the 
Modeling  screen.  When  the  user  presses  the  “Generate”  button,  all  the  synthetic  nodes  are 
computed  and  placed  in  a  folder  called  “[calculated  data]”  (see  Figure).  The  new  series  appears 
in  the  Data  Selection  screen  and  acts  like  any  regular  imported  data. 

The  user  can  also  create  Constant  nodes  that  allow  them  to  define  constant  numeric  values. 
This  is  done  using  the  n  (pi)  icon  on  the  toolbar. 

As  reported  last  quarter,  we  solved  the  problem  of  representing  and  executing  non- 
commutative  operations  like  subtraction  and  division.  Since  the  predecessor  nodes  in  the 
existing  graph  system  are  undifferentiated,  there  was  no  obvious  way  to  indicate  non- 
commutative  operations  in  a  single  step.  Originally,  we  planned  to  use  a  combination  of 
“multiply  by  -1”  and  ADD,  but  this  seemed  cumbersome.  The  new  design  uses  the  properties 
pane  to  indicate  which  inputs  are  subtracted  versus  added; 
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Label: 

Description: 

Operation 

Egypt,  Arab  Rep.:  G 
Egypt,  Arab  Rep.:  F 

Operation  Node 

Multiplication  /  Division 

MUL 

MULTIPLY 

DIVIDE 

Save 

Load 

Clear 

Generate 

Reset 

Figure  4.  New  graph  properties  design  showing  display  for  an  operator-type  node  in  data  synthesis 

In  this  design  multiplication  and  division  use  the  same  operator  which  has  a  “MUL”  label,  and 
likewise  addition  and  subtraction  use  the  same  operator.  When  inputs  are  connected  to  the 
operator  node  they  automatically  appear  in  the  properties  box  as  shown  above.  The  user  can 
then  select  either  “Multiply”  or  “Divide”  from  a  drop-down  list.  This  design  obviates  operators 
such  as  inversion.  In  the  new  design  inversion  is  achieved  by  using  a  multiply/divide  operator 
and  then  selecting  “/”  from  the  drop  down.  In  the  example  in  the  Figure  you  can  see  the 
operator  has  been  configured  to  MULTIPLY  by  GDP  and  DIVIDE  by  Foreign  Investment, 
creating  a  ratio. 

As  reported  last  quarter,  the  user  can  create  complex  expressions  using  the  Data  Synthesis 
capability.  Synthetic  outputs  can  be  hooked  up  to  other  operations  to  generate  new  outputs, 
creating  a  waterfall  of  calculations.  In  this  way,  many  different  synthetic  series  can  be 
generated  at  once.  If  the  series  already  exists  in  the  dataset,  it  is  updated  to  reflect  the  new 
values.  The  user  can  even  define  circular  calculations.  In  this  case,  the  Generation  action 
completes  one  loop  through  each  cycle  every  time  the  button  is  pressed.  This  allows  iterative 
computations  to  be  done. 

4.  Planned  Activities 

During  the  upcoming  reporting  period,  we  plan  to  focus  on  the  following  tasks: 

•  Testing,  debugging,  and  releasing  a  new  version  of  MAT  including  the  new 
functionality  developed  since  the  last  release  last  year. 

•  Continuing  to  improve  the  automated  model  learning  functionality  and  integrating  it 
with  the  data  synthesis  capability. 

•  Improved  access  to  the  MAT  causal  analysis  capabilities  through  the  user  interface. 

•  Beginning  work  on  data  validation. 

5.  Evaluation  and  Transition 

We  continue  to  focus  on  making  MAT  available  to  the  government  and  academic  research 
communities  and  to  look  for  opportunities  to  use  MAT  on  a  variety  of  ongoing  research  efforts. 
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During  this  reporting  period,  we  authored  a  paper  entitled  “A  Big  Data  Methodology  for 
Bridging  Qualitative  and  Quantitative  Political  Science  Research”  to  appear  at  the  American 
Political  Science  Association  Annual  Meeting  as  part  of  a  panel  on  Information  Technologies 
in  Politics  and  Political  Science.  The  theme  of  the  2014  meeting  is  Politics  after  the  Digital 
Revolution,  examining  the  way  the  modem  information  environment  affects  not  only  politics, 
but  the  ways  in  which  researchers  can  study  political  and  social  phenomena.  In  this  paper,  we 
present  the  MAT  methodology  as  a  means  for  both  qualitative  and  quantitative  political  science 
researchers  to  better  take  advantage  of  the  constantly  expanding  digital  data  environment. 

We  were  able  to  demonstrate  MAT  to  Dr.  Adam  Russell  from  lARPA  at  the  ONR  program 
briefing  in  June  and  talked  to  him  about  his  interest.  Unfortunately,  there  were  no  immediately 
obvious  uses  for  MAT  on  his  ongoing  projects,  but  we  will  continue  to  talk  with  him  through 
our  work  with  him  on  the  lARPA  SHARP  program  and  will  continue  to  look  for  opportunities 
to  support  him  and  his  work. 

During  this  period,  we  also  received  a  request  for  and  delivered  a  copy  of  MAT  to  a  postdoc  of 
Dr.  Arie  Kmglanski  at  the  University  of  Maryland. 

Table  1  summarizes  our  progress  in  this  regard  to  date.  We  will  continue  to  update  this  table  as 
we  make  additional  progress  and  will  include  it  as  a  regular  part  of  future  status  reports. 


Program 

Customer 

Comments 

On-going  efforts 

Tourniquet  Master  Trainer 
(TMT) 

(Phase  1  SBIR) 

US  Army's  Telemedicine  & 
Advanced  Technology  Research 
Center  (TATRC) 

MAT  is  being  used  to  visualize 
and  analyze  data  from  sensors 
on  a  medical  manikin  that 

indicate  whether  a  number  of 

novel  medical  devices  used  to 
combat  junctional  and  inguinal 
hemorrhaging  are  being  applied 
properly. 

This  program  is  about  to  begin  a 
Phase  II  where  MAT  will 
continue  to  be  used  both  by 
Charles  River  Analytics  and  our 
partners  at  the  University  of 
Wisconsin. 
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Laparoscopic  Surgery  Training 
System  (LASTS) 

(Phase  II  SBIR) 

US  Navy's  Office  of  Naval 

Research  (ONR) 

Under  lasts,  Charles  River  and 
Caroline  Cao  at  Wright  State 
University  are  using  MAT  to 
analyze  data  collected  from  the 
location  of  the  laproscopic 
surgery  tools  tools  during  an 
experiment.  Surgical  tools  are 
instrumented  with  markers  and 

3D  data  is  collected  on  their 
location  as  the  person  performs 
the  task. 

This  is  an  ongoing  Phase  II  SBIR 
program. 

Cognitive  Readiness  Agents  for 
Neural  Imaging  and 

Understanding  Models 
(CRANIUM) 

(Phase  1  SBIR) 

US  Navy's  Office  of  Naval 

Research  (ONR) 

MAT  was  used  to  visualize  and 
extract  patterns  of  stress  and 
workload  from  neuro¬ 
physiological  data  for  training 
systems. 

This  was  a  Phase  1  SBIR  program 
that  did  not  progress  to  Phase  11. 

Business  Intelligence 

Visualization  for  Organizational 
Understanding,  Analysis,  and 
Collaboration  (BIVOUAC) 

Phase  II  SBIR 

US  Navy's  Space  and  Naval 
Warfare  Systems  Command 
(SPAWAR) 

MAT  is  being  evaluated  as  part 
of  the  BIVOUAC  SBIR  program, 
which  provides  data  analysis 
and  visualization  for  Enterprise 
Resource  Planning  (ERP) 
systems  for  the  Navy. 

This  is  an  ongoing  Phase  II  SBIR 
program. 

Adaptive  toolkit  for  the 
Assessment  and  augmentation 
of  Performance  by  Teams  in 

Real  time  (ADAPTER) 

(Phase  1  SBIR) 

US  Air  Force  Research  Lab 

Human  Effectiveness 

Directorate  (AFRL/RH) 

MAT  is  being  used  to  analyze 
neuro-physiological  data  from 
cyber  operators  to  evaluate 
cognitive  workload  during  team- 
based  cyber  operations. 

This  program  has  been  chosen 
to  go  to  Phase  II  and  we 
awaiting  contract  award. 

Anticipated  Efforts 
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Enhancing  Intuitive  Decision 
Making  Through  Implicit 
Learning  (I2BRC) 

(ONR  Basic  Research  Challenge 
BAA) 

US  Navy's  Office  of  Naval 
Research  (ONR) 

Charles  River  is  a  subcontractor 
to  DSCI  MESH  Solutions,  LLC 

The  intention  is  to  use  MAT  to  help 
analyze  neuro-physiological  data  to 
help  better  understand  how 
implicit  learning  and  intuitive 
decision  making  work. 

This  is  an  ongoing  BAA  program, 
though  no  data  has  yet  been 
collected  to  analyze. 

A  system  for  augmenting 
training  by  Monitoring, 
Extracting,  and  Decoding 
Indicators  of  Cognitive  Load 
(MEDIC) 

US  Army's  Telemedicine  & 
Advanced  Technology  Research 
Center  (TATRC) 

We  are  evaluating  the 
practicability  of  using  MAT  to 
analyze  and  visualize  neuro¬ 
physiological  data  from  combat 
medic  trainees  to  identify  periods 
of  stress  and  cognitive  overload. 

This  is  a  SBIR  Phase  1  program 
where  MAT  is  being  evaluated.  The 
Phase  II  proposal  is  currently  being 
written. 

Soldier's  Intelligence  Fusion 
Toolkit  (SIFT) 

US  Army  Research  Laboratory 
(ARL) 

Extend  MAT  for  ARL  research 
objective  in  high-level  information 
fusion,  exploitation,  social  network 
analysis  and  knowledge 
management  research. 

A  BAA  white  paper  submission  has 
been  requested  and  has  been 
submitted. 

Table  1.  MAT  Transition  and  Use  Progress 

In  addition  we  have  provided  copies  of  MAT  to  the  following  institutions  based  on  their 
requests  for  the  software:  the  University  of  Michigan,  Arizona  State  University,  Kansas  State 
University,  University  of  California  at  Los  Angeles,  the  Naval  Medical  Research  Unit  at 
Wright  Patterson  Air  Force  Base,  Concordia  University  (Montreal),  the  University  of 
Wisconsin,  the  University  of  Maryland,  and  the  Air  Force  Research  Laboratory’s  Human 
Effectiveness  Directorate,  the  Intelligence  Advanced  Research  Projects  Agency  (lARPA),  and 
the  Joint  Advanced  Warfighting  Division  (JAWD). 

6.  Budget  and  Project  Tracking 

As  of  July  31,  2014,  we  have  spent  $621,833,  or  67%  of  our  total  budget  of  $928,224,  in  65% 
of  the  scheduled  time.  Our  current  funding  is  $662,477,  so  we  have  spent  94%  of  our  available 
funding. 
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Overall,  we  believe  we  are  in  good  shape  to  complete  the  project  on  time  and  on  budget. 
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